Learning From Demonstrations via Structured Prediction
نویسندگان
چکیده
Demonstrations from a teacher are invaluable to any student trying to learn a given behavior. Used correctly, demonstrations can speed up both human and machine learning by orders of magnitude. An important question, then, is how best to extract the knowledge encoded by the teacher in these demonstrations. In this paper, we present a method of learning from demonstrations that leverages some of the structured prediction techniques currently under investigation in the literature. We report encouraging results in Wargus, a real-time strategy game.
منابع مشابه
An investigation of imitation learning algorithms for structured prediction
In the imitation learning paradigm algorithms learn from expert demonstrations in order to become able to accomplish a particular task. Daumé III et al. (2009) framed structured prediction in this paradigm and developed the search-based structured prediction algorithm (Searn) which has been applied successfully to various natural language processing tasks with state-of-the-art performance. Rece...
متن کاملRobobarista: Object Part Based Transfer of Manipulation Trajectories from Crowd-Sourcing in 3D Pointclouds
There is a large variety of objects and appliances in human environments, such as stoves, coffee dispensers, juice extractors, and so on. It is challenging for a roboticist to program a robot for each of these object types and for each of their instantiations. In this work, we present a novel approach to manipulation planning based on the idea that many household objects share similarly-operate...
متن کاملRobobarista: Learning to Manipulate Novel Objects via Deep Multimodal Embedding
There is a large variety of objects and appliances in human environments, such as stoves, coffee dispensers, juice extractors, and so on. It is challenging for a roboticist to program a robot for each of these object types and for each of their instantiations. In this work, we present a novel approach to manipulation planning based on the idea that many household objects share similarly-operate...
متن کاملPredict and Constrain: Modeling Cardinality in Deep Structured Prediction
Many machine learning problems require the prediction of multi-dimensional labels. Such structured prediction models can benefit from modeling dependencies between labels. Recently, several deep learning approaches to structured prediction have been proposed. Here we focus on capturing cardinality constraints in such models. Namely, constraining the number of non-zero labels that the model outp...
متن کاملBipedal Locomotion Primitive Learning, Control and Prediction from Human Data
At the current stage bipedal robot locomotion is quite different from human walking. Imitation learning framework from human demonstrations is an efficient approach to lead towards human-like behaviors. This paper addresses a framework for real-time wholebody human motion imitation by a humanoid robot. The framework is a structured mixture of whole body motion control, learning and prediction. ...
متن کامل